NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Learning from many trajectories

Tu, Stephen; Frostig, Roy; Soltanolkotabi, Mahdi (April 2024, Journal on Machine Learning Research)

We initiate a study of supervised learning from many independent sequences ("trajectories") of non-independent covariates, reflecting tasks in sequence modeling, control, and reinforcement learning. Conceptually, our multi-trajectory setup sits between two traditional settings in statistical learning theory: learning from independent examples and learning from a single auto-correlated sequence. Our conditions for efficient learning generalize the former setting--trajectories must be non-degenerate in ways that extend standard requirements for independent examples. Notably, we do not require that trajectories be ergodic, long, nor strictly stable. For linear least-squares regression, given n-dimensional examples produced by m trajectories, each of length T, we observe a notable change in statistical efficiency as the number of trajectories increases from a few (namely m<=n) to many (namely m>=n). Specifically, we establish that the worst-case error rate of this problem is n/(mT) whenever m>=n. Meanwhile, when m<=n, we establish a (sharp) lower bound of n^2/(m^2T) on the worst-case error rate, realized by a simple, marginally unstable linear dynamical system. A key upshot is that, in domains where trajectories regularly reset, the error rate eventually behaves as if all of the examples were independent, drawn from their marginals. As a corollary of our analysis, we also improve guarantees for the linear system identification problem.
more » « less
Full Text Available
Sharp Rates in Dependent Learning Theory: Avoiding Sample Size Deflation for the Square Loss

Ziemann, Ingvar; Tu, Stephen; Pappas, George J; Matni, Nikolai (May 2024, ICML 2024)

Full Text Available
Learning Robust Output Control Barrier Functions From Safe Expert Demonstrations

https://doi.org/10.1109/OJCSYS.2024.3385348

Lindemann, Lars; Robey, Alexander; Jiang, Lejun; Das, Satyajeet; Tu, Stephen; Matni, Nikolai (January 2024, IEEE Open Journal of Control Systems)

Full Text Available
The noise level in linear regression with dependent data

Ziemann, Ingvar; Tu, Stephen; Pappas, George J.; Matni, Nikolai (September 2023, Neurips - Openreview)

Full Text Available
Multi-task Imitation Learning for Linear Dynamical Systems

Zhang, Thomas T.; Kang, Katie; Lee, Bruce D.; Tomlin, Claire; Levine, Sergey; Tu, Stephen; Matni, Nikolai (July 2023, L4DC - PMLR)

Full Text Available
Robots That Ask For Help: Uncertainty Alignment for Large Language Model Planners

Ren, Allen Z; Dixit, Anushri; Bodrova, Alexandra; Singh, Sumeet; Tu, Stephen; Brown, Noah; Xu, Pen; Takayama, Leila Takayama; Xia, Fei; Varley, Jake; et al (November 2023, Conference on Robot Learning (CoRL))

Large language models (LLMs) exhibit a wide range of promising capabilities -- from step-by-step planning to commonsense reasoning -- that may provide utility for robots, but remain prone to confidently hallucinated predictions. In this work, we present KnowNo, which is a framework for measuring and aligning the uncertainty of LLM-based planners such that they know when they don't know and ask for help when needed. KnowNo builds on the theory of conformal prediction to provide statistical guarantees on task completion while minimizing human help in complex multi-step planning settings. Experiments across a variety of simulated and real robot setups that involve tasks with different modes of ambiguity (e.g., from spatial to numeric uncertainties, from human preferences to Winograd schemas) show that KnowNo performs favorably over modern baselines (which may involve ensembles or extensive prompt tuning) in terms of improving efficiency and autonomy, while providing formal assurances. KnowNo can be used with LLMs out of the box without model-finetuning, and suggests a promising lightweight approach to modeling uncertainty that can complement and scale with the growing capabilities of foundation models.
more » « less
Full Text Available
On the Sample Complexity of Stability Constrained Imitation Learning

Tu, Stephen; Robey, Alexander; Zhang, Tingnan; Matni, Nikolai (January 2022, Learning for Dynamics and Control)

Full Text Available
TaSIL: Taylor Series Imitation Learning

Pfrommer, Daniel; Zhang, Thomas TCK; Tu, Stephen; Matni, Nikolai (January 2022, Conference on Neural Information Processing Systems)

Full Text Available
Adversarially Robust Stability Certificates can be Sample-Efficient

Zhang, Thomas; Tu, Stephen; Boffi, Nicholas; Slotine, Jean-Jacques; Matni, Nikolai (January 2022, Learning for Dynamics and Control)

Full Text Available

Search for: All records